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2 Information extraction: Is question answering an acquired skill? 

Ganesh Ramakrishnan, Soumen Chakrabarti, Deepa Paranjpe, Pushpak Bhattacharya 
May 2004 Proceedings of the 13th international conference on World Wide Web 

Full text available: '| |pdf(260.13 KB) Additional Information: full citation , abstract , references , index terms 

We present a question answering (QA) system which learns how to detect and rank answer 
passages by analyzing questions and their answers (QA pairs) provided as training data. We 
built our system in only a few person-months using off-the-shelf components: a part-of- 
speech tagger, a shallow parser, a lexical network, and a few well-known supervised 
learning algorithms. In contrast, many of the top TREC QA systems are large group efforts, 
using customized ontologies, question classifiers, and highl ... 

Keywords: machine learning, question answering 



3 Database theory, technology and applications (DTTA): Simplified access to structured U 
databases by adapting keyword search and database selection 
Mohammad Hassan, Reda Alhajj, Mick J. Ridley, Ken Barker 

March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ^ pdf(219.19 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a tool that enables non-technical (naive) end-users to use free-form 
queries in exploring distributed relational databases with simple and direct technique, in a 
fashion similar to using search engines to search text files on the web. This allows web 
designers and database developers to publish their databases for web browsers exploring. 
The proposed approach can be used for both Internet and Intranet application areas. Our 
approach depends on identifying first databases that ... 
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Damien K. Fisher, Franky Lam, William M. Shui, Raymond K. Wong 

November 2003 Proceedings of the twelfth international conference on Information and 
knowledge management 

Full text available: ^pdfd 27.44 KB) Additional Information: full citation , abstract , references , index terms 

With the increasing popularity of XML, there arises the need for managing and querying 
information in this form. Several query languages, such as XQuery, have been proposed 
which return their results in document order. However, most recent efforts focused on 
query optimization have disregarded order. This paper presents a simple yet elegant 
method to maintain document ordering for XML data. Analysis of our method shows that it 
is indeed efficient and scalable, even for changing data. 

Keywords: XML, dynamic, order maintenance, semi-structured data 
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Intelligent web information access: Answering imprecise database queries: a novel 
approach 

Ullas Nambiar, Subbarao Kambhampati 

November 2003 Proceedings of the 5th ACM international workshop on Web 
information and data management 

Full text available- IB pdff 385.91 KB) Additional Information: full citation , abstract, references , citings, index 
^ — terms 

A growing number of databases especially those published on the Web are becoming 
available to external users. Users of these databases are provided simple form-based query 
interfaces that hide the underlying schematic details. Constrained by the expressiveness of 
the query interface users often have difficulty in articulating a precise query over the 
database. Supporting imprecise queries over such systems would allow users to quickly find 
relevant answers without iteratively refining th ... 

Keywords: imprecise queries, query, relational database, similarity 



XML and text: XRANK: ranked keyword search over XML documents 
Lin Guo, Feng Shao, Chavdar Botev, Jayavel Shanmugasundaram 
June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available- ^pdf(265.38 KB) Additional Information: full citation , abstract, references , citings, index 
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We consider the problem of efficiently producing ranked results for keyword search queries 
over hyperlinked XML documents. Evaluating keyword search queries over hierarchical XML 
documents, as opposed to (conceptually) flat HTML documents, introduces many new 
challenges. First, XML keyword search queries do not always return entire documents, but 
can return deeply nested XML elements that contain the desired keywords. Second, the 
nested structure of XML implies that the notion of ranking is no I ... 

Link-based ranking 1: Scaling personalized web search 
Glen Jeh, Jennifer Widom 

May 2003 Proceedings of the twelfth international conference on World Wide Web 
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Recent web search techniques augment traditional text matching with a global notion of 
"importance" based on the linkage structure of the web, such as in Google's PageRank 
algorithm. For more refined searches, this global notion of importance can be specialized to 
create personalized views of importance— for example, importance scores can be biased 
according to a user-specified set of initially-interesting pages. Computing and storing all 
possible personalized views in advance is impractical, as ... 

Keywords: PageRank, web search 

8 A multi-paradigm querying approach for a generic multimedia database management Q 
system 

Ji-Rong Wen, Qing Li, Wei-Ying Ma, Hong-Jiang Zhang 
March 2003 ACM SIGMOD Record, Volume 32 issue l 

Full text available: ^ pdf(524.08 KB) Additional Information: full citation , abstract , references , citings 

To truly meet the requirements of multimedia database (MMDB) management, an 
integrated framework for modeling, managing and retrieving various kinds of media data in 
a uniform way is necessary. MediaLand is an experimental MMDB platform being developed 
at Microsoft Research Asia for users with different levels of experiences and expertise to 
manage and search multimedia repositories easily, efficiently, and cooperatively. Key 
features of MediaLand include a uniform data model for describi ... 

Keywords: media independence, multi-paradigm querying, multimedia database 
management, uniform data modeling 

9 Web search 1 : Searching web databases by structuring keyword-based queries Q 
Pavel Calado, Altigran S. da Silva, Rodrigo C. Vieira, Alberto H. F. Laender, Berthier A. Ribeiro- 
Neto 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available - "Rl odf(204 22 KB) Additional Information: full citation , abstract , references , citings , index 



terms 

On-line information services have become widespread in the Web nowadays. However, Web 
users are non-specialized and have a great variety of interests. Thus, interfaces for Web 
databases must be simple and uniform. In this paper we present an approach, based on 
Bayesian networks, for querying Web databases using keywords only. According to this 
approach, the user inputs a query through a simple search-box interface. From the input 
query, one or more plausible structured queries are derived and su ... 

Keywords: query structuring, structured queries, web databases 



10 Semistructured Data: Structural proximity searching for large collections of semi- 
structured data 
Michael Barg, Raymond K. Wong 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Full text available" f ^j pdf(1.92 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

The richness of the XML data format allows data to be structured in a way which precisely 
captures the semantics required by the author. It is the structure of the data, however, 
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which forms the basis of all XML query languages. Without at least some notion of the 
structure, a user cannot meaningfully query the data. This problem is compounded when 
one considers that heterogeneous data adhering to different schema are likely to exist in 
the database(s) being queried. This paper proposes a soluti ... 

11 External memory algorithms and data structures: dealing with massive data 
Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), Volume 33 issue 2 

Full text available- pdf{828 46 KB) Additional Information: full citation , abstract , references , citings , index 
k^ - * 1 terms 

Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit 
locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



12 Expressive retrieval from XML documents 
Taurai Tapiwa Chinenyanga, Nicholas Kushmerick 

September 2001 Proceedings of the 24th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Additional Information: full citation , abstract , references , citings , index 



Full text available: Wj pdf(400.63 KB) 

t-*-" terms 

The emergence of XML as a standard interchange format for structured documents/data has 
given rise to many XML query language proposals. However, some of these languages do 
not support information retrieval-style ranked queries based on textual similarity. There 
have been several extensions to these query languages to support keyword search, but the 
resulting query languages cannot express queries such as" 'find books and CDs with similar 
titles". Either these extensions u ... 

13 Component selection and matching for IP-based design | 
G. Martin, R. Seepold, T. Zhang, L. Benini, G. De Micheli 

March 2001 Proceedings of the conference on Design, automation and test in Europe 

Full text available: ^) pdf(170.22 KB) Additional Information: full citation , references , citings , index terms 



14 Integrating content search with structure analysis for hypermedia retrieval and Q 
management 

Wen-Syan Li, K. Selguk Candan 

December 1999 ACM Computing Surveys (CSUR) 

Full text available: ^pdf(25.42 KB) Additional Information: full citation , references , index terms 
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Glen Jeh, Jennifer Widom 

August 2004 Proceedings of the 2004 ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: "j^ pdf(255.01 KB) Additional Information: full citation , abstract , references , index terms 

Existing data mining algorithms on graphs look for nodes satisfying specific properties, such 
as specific notions of structural similarity or specific measures of link-based importance. 
While such analyses for predetermined properties can be effective in well-understood 
domains, sometimes identifying an appropriate property for analysis can be a challenge, 
and focusing on a single property may neglect other important aspects of the data. In this 
paper, we develop a foundation for mining the prop ... 



Keywords: data mining, graph mining 



16 Posters: Providing ranked relevant results for web database queries Q 
Ullas Nambiar, Subbarao Kambhampati 

May 2004 Proceedings of the 13th international World Wide Web conference on 
Alternate track papers & posters 

Full text available- fP ) pdf(45.19 KB) Additional Information: full citation, abstract, references, citings, index 
' ^ terms 

Often Web database users experience difficulty in articulating their needs using a precise 
query. Providing ranked set of possible answers would benefit such users. We propose to 
provide ranked answers to user queries by identifying a set of queries from the query log 
whose answers are relevant to the given user query. The relevance detection is done using 
a domain and end-user independent content similarity estimation technique. 

Keywords: content similarity, query suggestion, web-enabled database 



17 Paper session 5: approximate and ranked query processing: Mining approximate | 
functional dependencies and concept similarities to answer imprecise queries 
Ullas Nambiar, Subbarao Kambhampati 

June 2004 Proceedings of the 7th International Workshop on the Web and Databases: 
colocated with ACM SIGMOD/PODS 2004 

Full text available: ^ pdf( 195.43 KB) Additional Information: full citation , abstract , references 

Current approaches for answering queries with imprecise constraints require users to 
provide distance metrics and importance measures for attributes of interest. In this paper 
we focus on providing a domain and end-user independent solution for supporting imprecise 
queries over Web databases without affecting the underlying database. We propose a query 
processing framework that integrates techniques from IR and database research to 
efficiently determine answers for imprecise queries. We m ... 

Keywords: approximate functional dependencies, imprecise queries, tuple similarity 



18 Research sessions: text and DB: On the integration of structure indexes and inverted Q 
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Raghav Kaushik, Rajasekar Krishnamurthy, Jeffrey F. Naughton, Raghu Ramakrishnan 
June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Full text available: | ^ pdf(228.17 KB) Additional Information: full citation , abstract , references , citings 

Several methods have been proposed to evaluate queries over a native XML DBMS, where 
the queries specify both path and keyword constraints. These broadly consist of graph 
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traversal approaches, optimized with auxiliary structures known as structure indexes; and 
approaches based on information-retrieval style inverted lists. We propose a strategy that 
combines the two forms of auxiliary indexes, and a query evaluation algorithm for 
branching path expressions based on this strategy. Our technique i ... 
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